ShAPO: Implicit Representations for Multi-object Shape, Appearance, and Pose Optimization

نویسندگان

چکیده

Our method studies the complex task of object-centric 3D understanding from a single RGB-D observation. As it is an ill-posed problem, existing methods suffer low performance for both shape and 6D pose size estimation in multi-object scenarios with occlusions. We present ShAPO, joint detection, textured reconstruction, object estimation. Key to ShAPO single-shot pipeline regress shape, appearance latent codes along masks each instance, which then further refined sparse-to-dense fashion. A novel disentangled database priors first learned embed objects their respective space. also propose novel, octree-based differentiable optimization step, allowing us improve simultaneously under space, analysis-by-synthesis implicit representation allows accurately identify reconstruct unseen without having access meshes. Through extensive experiments, we show that our method, trained on simulated indoor scenes, regresses real-world minimal fine-tuning. significantly out-performs all baselines NOCS dataset 8% absolute improvement mAP

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Shape Representations for Object Recognition

The problem of object recognition has been at the forefront of computer vision research in the last decade. The most successful approaches have used mainly edgeor texture-based representations. The shape of the object outline, albeit widely used for pre-segmented objects, has found limited applicability to the detection problem in real images. The fact that shape is a truly holistic global perc...

متن کامل

Object Discovery through Motion, Appearance and Shape

We examine the problem of Object Discovery, the autonomous acquisition of object models, using a combination of shape, appearance and motion. We propose a novel multi-stage technique for detecting rigidly moving objects and modeling their appearance for recognition. First, a stereo camera is used to acquire a sequence of images and depth maps of a given scene. Then the scene is oversegmented us...

متن کامل

Pose Estimation and Object Identiication Using Complex Algebraic Representations Pose Estimation and Object Identiication Using Complex Algebraic Representations

The comparison and alignment of two similar objects is a fundamental problem in pattern recognition and computer vision that has been considered using various approaches. In this work, we employ a complex representation for an algebraic curve, and illustrate how the algebraic transformation which relates two Euclidean equivalent curves can be determined using this representation. The idea is ba...

متن کامل

Modeling Pose/Appearance Relations for Improved Object Localization and Pose Estimation in 2D images

We propose a multiview model of appearance of objects that explicitly represents their variations of appearance with respect to their 3D pose. This results in a probabilistic, generative model capable of precisely synthesizing novel views of the learned object in arbitrary poses, not limited to the discrete set of trained viewpoints. We show how to use this model on the task of localization and...

متن کامل

TextonBoost: Joint Appearance, Shape and Context Modeling for Multi-class Object Recognition and Segmentation

This paper proposes a new approach to learning a discriminative model of object classes, incorporating appearance, shape and context information efficiently. The learned model is used for automatic visual recognition and semantic segmentation of photographs. Our discriminative model exploits novel features, based on textons, which jointly model shape and texture. Unary classification and featur...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Lecture Notes in Computer Science

سال: 2022

ISSN: ['1611-3349', '0302-9743']

DOI: https://doi.org/10.1007/978-3-031-20086-1_16